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Abstract 

Both parametric distribution functions appearing in extreme value theory - the general- 
ized extreme value distribution and the generalized Pareto distribution - have log-concave 
densities if the extreme value index 7 £ [—1, 0]. Replacing the order statistics in tail index 
estimators by their corresponding quantiles from the distribution function that is based on 
the estimated log-concave density f n leads to novel smooth quantile and tail index estima- 
tors. These new estimators aim at estimating the tail index especially in small samples. 
Acting as a smoother of the empirical distribution function, the log-concave distribution 
function estimator reduces estimation variability to a much greater extent than it intro- 
duces bias. As a consequence, Monte Carlo simulations demonstrate that the smoothed 
version of the estimators are well superior to their non-smoothed counterparts, in terms of 
mean squared error. 

Keywords: "extreme value" theory; log-concave density estimation; negative Hill esti- 
mator; Pickands estimator; tail index estimation; small-sample performance 
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1 Introduction 

It is a well-known fact that asymptotic results are in general at best approximately valid 
in small-sample problems, but that in the latter situation bias is often a serious issue. For 
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example in extreme value theory the small-sample bias in the estimation of the tail index 
is severely. We refer to [6] for a study of a number of estimators. There are only a few 
more articles that focus on the small-sample performance of tail-index estimators. We are 
aware of [2J, [22J and [55]. These consider mainly tail-index estimation for heavy-tailed 
distributions. In this article we investigate the small-sample behavior of a new smooth 
tail- index estimator for thin-tailed and distributions with finite endpoint. The main aim 
of this article is to introduce the smoothed estimators that exploit the log-concavity of the 
limiting density of the exceedances or of the largest order statistics, respectively. In Section 
[2j we present the connection of log-concavity and extreme- value theory. Further, we show 
in Section [3J that replacing the empirical distribution function by the smooth estimator F n 
based on the log-concave density estimator - see ([2]) for a proper definition of F n - leads 
to novel tail-index estimators that exhibit substantially decreased mean-squared error in 
small-sample situations. We illustrate this finding in a simulation study in Section H] for 
two settings (1) a generalized Pareto distribution and (2) a domain of attraction scenario. 
The paper concludes with some brief remarks in Section 

2 Log-concavity in extreme-value theory 

2.1 Max-domain of attraction of distributions with log-concave 
densities. 

Let {X n , n > 1} be a sequence of independent random variables with common distribution 
function F, such that F belongs to the max-domain of attraction of G, denoted by F G 
T>(G), i.e. there exist constants a n > 0, b n G R such that for x G R and 

G(x) = lim P ( a" 1 • [max(Xi, . . . , X n ) + b n ] < x ) 
= lim F n (a n ■ x - b n ) 

n— >oo 

which is equivalent to 

sup \F n (a n ■ x — b n ) — G(x)\ — > 0, as n — > oo. 

From [15] it is known that F G T>{G) if and only if G G {G 7 : 7 G R}, where 

G 7 (z) = exp (-(1 + 7 x)" 1/7 ) , 1 + 7 x > 0, 

where G 7 is called the extreme value distribution with tail index 7 G R, shift parameter 0, 
and scale parameter 1. Since 

(1 + 7x) -1 / 7 — > exp(— x), for 7 — > 0, 
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interpret Gq(x) as exp(— e~ x ). The two most common settings in the analysis of ex- 
treme values are that we either have an observed sequence of independent and identi- 
cally distributed maxima, M n> i, . . . , M n> k, or upper order statistics, Xr n \ > Xr n _i\ > 
. . . > Xr n _ Kn \, from an independent and identically distributed sample X±, . . . ,X n with 
K n = k n < n or K n = #{Xi : > u n } a random number, where u n is some suitably 
chosen high threshold and #A denotes the number of elements in set A. We will focus on 
the latter setting and in general we assume that K n is an intermediate sequence that is 
K n /n — ► and K n — > oo if n — > oo. 

2.2 The Generalized Pareto Distribution and log-concavity. 

To fix notation, define for a general distribution function F the lower endpoint ot{F) := 
inf{.x E R : F(x) > 0} and the upper endpoint u>(F) := sup{x E R : < 1}. The 

quantile function of F for q 6 [0, 1] is 

:= inf{x G R : F(x) < g}. 

Exceedances X^ — u, . . . , X^ n _ kn+i ^ — u of a high threshold it = n n or of an intermediate 
order statistic u = X^ n _ kn ^ are typically modelled by the generalized Pareto distribution 
(GPD), established by Pickands (see [29]). For 7 € R and a € (0, 00) the density of the 
GPD is given by 

w^(x) := a- 1 (l + 7 x/a)-( 1+1 /7) ) xe [0,-a/ 7 ], 

where wq^ is again defined via continuity: Wq a (x) = cr" 1 exp(— x/a) for x E [0, 00). 

|28j showed that both parametric distribution functions appearing in extreme value 
theory - the generalized extreme value distribution (GEV) and the generalized Pareto 
distribution - have log-concave densities if the extreme value index 7 E [—1,0] and that all 
distribution functions F with log-concave density belong to the max-domain of attraction of 
the GEV with 7 E [—1,0]. For any distribution function F with corresponding real- valued 
log-concave density function /, this latter / can be written as 

f(x) = expc^O), (1) 

for a concave function (p : R — > [—00, 00). We will denote the class of distribution functions 
F having a log-concave density on its support 5 = [a(F),u)(F)] by ^(S). 

2.3 The restriction 7 G [—1,0] and connection to bump hunt- 
ing. 

We are aware that the restriction of log-concave densities and of 7 E [—1,0], respectively, 
is a drawback if there is not sufficient evidence to assume that this restriction holds. On 
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the other hand there are often good reasons to assume that some distribution function F 
has all its moments finite or that its support is finite, implying 7 £ [—1, 0]. Estimating the 
finite endpoint u){F) of a distribution F is linked to the problem of estimating 7 < and 
its theory is well developed, see for example [IS], [5]) [12] and JT3J. The data set of the total 
life span of people who died in the Netherlands, who were born between the years 1877- 
1881, as analyzed in [I] is a real life example that results in an estimated finite endpoint 
and an estimated tail index between —1/2 and 0. Further data sets on survival times of 
208 mice exposed to radiation and on the men's 100m running times of the 1988 and 1992 
Olympic Games are analyzed in [19]. By definition the distribution of the distance of two 
points in a closed convex set has finite support and there are many open problems regarding 
its limit behavior. For the current state of research we refer to [25]. A further example 
that naturally leads to the restriction 7 < is the estimation of the efficiency frontier in 
economics (see [14]). In practical applications, 7 = —1/2 is often seen as natural lower 
bound, e.g. p. 62 in [21] or [T3]. 

In this context, we would like to point out [3D], who proposes a multiscale procedure to 
identify collections of intervals based on an i.i.d. sample where a density is either log-concave 
or log-convex. The chosen multiscale approach ensures that the claims (i.e. log-concavity 
or log-convexity) holds for all intervals in the collections simultaneously. Additionally, 
it is shown that the method asymptotically keeps the level. This offers a way to "pre- 
assess" whether 7 actually is in [—1,0]: For a chosen significance level a £ (0,1), apply 
this novel bump hunting method to either the sample of exceedances or the whole sample 
of observations. In the first case, if the collection of intervals claiming log-convexity is 
empty and the collection of intervals claiming log-concavity at best contains an interval 
(almost) spanning the whole range of exceedances, then we could claim with asymptotic 
probability 1 — a that indeed the observations we are looking at stem from a distribution 
FeFn CX»(G 7 ), 7 G [-1,0]. 

In the second case, we get an upper bound m > k for the number of upper order 
statistics we should take into account. Just define m such that A( n _, m+1 ) is the left-most 
endpoint of all intervals whereon the bump-hunting method claims log-concavity of the 
underlying density. 

In both cases, the new tail index estimation procedures as presented in Section [3J is 
adequate. 

3 Tail index estimation 

The estimation of 7 is besides the related high quantile estimation the most important 
problem in univariate extreme value theory and there exist a vast number of different 
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approaches. For example the Hill estimator ([20]), the maximum likelihood estimator ([3], 
[T8] , [35] , [36] , [37] ) , the moment estimator ( [7J ) , the (iterated) negative Hill estimator also 
known as Falk's estimator ([H], [12], [27]), the (generalized) Pickands estimator ([29], [8], 
[M]). estimators based on near extremes ([2B]), the weighted least squares estimator (|23j). 
probability weighted moments ([21]), and many more. All these estimators are based on 
an intermediate sequence of upper order statistics and it is well known (see e.g. |17j ) that a 
major drawback of such estimators is their discrete character. Using kernel-type estimators 
is one possibility to overcome this deficiency. We refer to [5] for the smoothed Hill estimator 
in case 7 > and to [IT] for general 7 £ I. Our alternative is to take advantage of the 
distribution function F n based on the log-concave density estimator f n , which is possible 
if the true 7 is in the restricted interval [—1, 0]. 

3.1 Motivation of new estimators. 

For an i.i.d. sample X±, . . . , X n where Xi has a log-concave density function as introduced 
in ([1]), let F n be the empirical distribution function and 



be the smoothed distribution function based on the log-concave density estimator f n = 
exp ifi n . A proper definition and properties of f n are given in [30] and [10] . We only 
mention one special feature: the estimator exp (p n of the log-density is a piecewise linear 
function with knots only at some of the observations points X\, . . . , X n and (p n = —00 on 
R \ [X\ , X n ] . How to actually compute f n is detailed in [31] and [9] . 
In [30] the following theorem is proven. 

Theorem 3.1. Let X±, . . . ,X n be an i.i.d. sample stemming from a distribution with log- 
concave density f such that f = expcp is Holder- continuous for an exponent (3 6 (1,2] and 
constant L > 0. Furthermore, fi'(x) — fi'{y) > C(y — x) for C > and A < x < y < B . 
Then, as n — > 00, 



where T n — > T := [yl,!?]. Furthermore, F n (Xn^) = and F n (X^) = 1. 

This theorem implies that F n is essentially equivalent to F n , but as the integral of 
a piecewise exponential function very smooth. These properties turn out to be highly 
convenient in the estimation of the extreme value tail index 7. The smoothness of F n 
reduces the variance not only considerably in the estimation of 7 but even for the estimation 
of quantiles of the generalized Pareto distribution, as is shown in Section [H 
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Many well-known tail index estimators are based on a selection of log-spacings of the 
sample, see also |34] , The key idea is now simply to replace the order statistics (or quantiles 
of the empirical distribution function) = ¥~ l (i/n), i = 1, . . . , n in these log-spacings 
by quantiles received via F n . This yields modified versions of the uniformly minimum 
variance unbiased estimator from Falk ([TT], [12]) for the case of a known endpoint, the 
negative Hill estimator as defined in [12] and Pickands' estimator ([29]) for the case of a 
unknown endpoint. We will denote these new estimators as "smoothed estimators". We 
choose the first two estimators because of their outstanding performance for 7 < and 
7 < —1/2, respectively. On the other hand it is well known that Pickands' estimator is 
not efficient and in addition it is able to estimate any 7 G 1, thus the comparison of the 
original to the smoothed version is not entirely fair. However, Pickands' estimator serves as 
the building block for much more efficient generalized Pickands' estimators that are more 
general linear combinations of log-spacings of order statistics (|34j). 



3.2 Global and tail behavior. 

Extreme value theory is, as the name suggests, tail focused. Hence, the behavior of the 
conditional distribution F x \x>in where u — > uj(F), dominates the limit results. On the 
other hand, the log-concavity of / = F' is a strong assumption on the entire shape of the 
distribution function F. If this strong assumption holds then the smoothing of the tail 
index estimators should be based on F n . Since tail index estimators use information of the 
upper tail of F n it was sufficient that the upper tail of F had log-concave density. Therefore, 
we investigate two settings in our simulation study in Section 01 (1) X\, . . . , X n \\dC{X) G 
Fn(S). Here the data is sharpened by F n . (2) X u . . . , X n MC(X) G £>(G 7 ;7 G [-1,0]). 
In this more general situation F is not necessarily in T^S) . Before the smooth tail index 
estimators are computed, the range of log-concavity of / has to be determined; either using 
bump-hunting or by imposing some assumptions on F. Here the data is sharpened by F m , 
where the smoothed distribution function is based on the m > K n largest order statistics 
and m is such that F x \ x>Xn _ m G Ff\(S fl [X n _ m , 00)). 



3.3 Smooth tail index estimators. 

First, let us pin down some notation. Suppose we are given a sample X±, . . . , X n from 
a GPD wherefrom we know that 7 ^ [ — 1 ^ 0] and with empirical distribution function F n . 
Denote the order statistics by X^ , . . . , X^ . For such a fixed sample, define for k = 4, . . . , n 
and H G {F n ,F n }: 

^ (m 1 / H~H(n - r k (H) + l)/n) - H~\{n - 2r k {H) + l)/n) x 

7pickl J log2 & \H~ 1 ((n-2r k (H) + l)/n)-H- 1 ((n-4r k {H) + l)/n)J , 
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where 

/m f[k/A\ if H = ¥ n , 
r k (H) = < 

fc/4 if H = F n , 

v 

with [m\ := max{n G No : n < m}. This construction not only exploits the superiority of 
the quantile estimates based on the smooth function F n , but also avoids "rounding bias". 
Using the inverse of a continuous distribution function, quantiles do not coincide for four 
consecutive fc's (order statistics), as it is the case for Pickands' original estimate. 

To generalize the estimators in and [12], no discrimination regarding continuity of 
H is necessary. For H G {¥ n ,F n } let 

<m 1 Y-i f X(n)-H~\(n-j + l)/n) ^ 

= fe3lE lQ g( X{n) -H-H(n-k)/n) fe = 3. • • • ~ 1 

k 



k 1 ^ ( u{F)-H-\{n-j + l)/n) 

7mvue(#) - kj^ 0g \ u;{F)-H^{{n-k)/n) 



,n 



where F is the true distribution function of the X^s. Note that 7mvue(-^0 * s om Y consistent 
if 7 G [—1,0). The chosen terminology reminds of the fact that when choosing H = F n , the 
above estimators boil down to Pickands', Falk's, and Falk's MVUE estimator as discussed 
at the beginning of this section. 

The new, smooth tail index estimators are now simply 7p ic ^(F n ), 7Faik(-^™)' an< ^ Tmvue(-^i)- 
Figure [T] displays Hill plots for two GPD pseudo-random samples, i.e. plots of the estima- 
tors versus the number of order statistics k, for the smoothed and unsmoothed versions for 
n = 64 and 7 G {-0.1, -0.75}. 

The smoothed estimators behave much more stable as a function of k and it is es- 
pecially noteworthy that for the two generated data sets all three smoothed estimators 
7pick(^«)' 7Falk(-^)' TmvtjeC^.) e hM] for evei T k - % construction, 7 Pick : W 1 -» R 
and 7Faik)7MVUE : R n [— 00, 0) which means that non permissible estimates outside the 
interval [—1, 0] potentially occur. However, due to consistency of ¥ n and F n , this is asymp- 
totically negligible. If in practice 7. £ [— 1,0] then, a truncation to its closest boundary 
value is recommendable if it is known that 7 G [—1,0]. 

It is beyond the scope and not the primary goal of this article to discuss the asymptotic 
behavior of the smoothed tail index estimators. 
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Pickands' estimator 



Falk's estimator 



Falk's MVU estimator 




Figure 1: Hill plots for n = 64 and 7 = —0.1 (plots in the upper row) and 7 = —0.75 (plots in 
the lower row), smoothed (-) and original (- -) versions. 

3.4 Further shape constraints. 

Straightforward computation yields the following lemma. 

Lemma 3.1. The density io 7iCT has the following qualitative properties which do not depend 
on the value of the scale parameter a: 

property parameter range 

convex non- decreasing 7 < — 1 

concave non-increasing 7 G [—1,-1/2] 

convex non-increasing 7 > —1/2 

log-concave 7 G [—1,0] 

log-convex 7 G (—00, —1] U [0, 00) 

Table 1: Form of w 1}(7 as determined by the tail index 7. 
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Since e.g. a density estimator f n in the class of convex decreasing densities is available, 
see [16] , the latter lemma raises the possibility to define smooth estimators of the tail index 
for other ranges of 7. However, especially in the latter case, slight modifications may be 
necessary, since F n (Xr n \) 7^ 1. Furthermore, maximum likelihood estimators under other 
constraints may not be as smooth as F n , since e.g. the estimator of a convex decreasing 
density is piecewise linear, whereas for log-concave densities this form appears for the 
estimator of the log-density. 

3.5 Computational details. 

These new smoothed estimators are made available in the R-package smoothtail (sec |33j). 
This latter package depends on the package logcondens ([32]), which offers two algorithms 
for the (weighted) estimation of an arbitrary log-concave density from an i.i.d. sample of 
observations. Both these packages are available from CRAN. 

4 Simulations 

4.1 Estimation of quant iles. 

The computation of the non-smoothed estimators TpaikO^)' ^PickO^) ano - 7mvueO^O heav- 
ily relies on the order statistics X/^,i = 1, . . . , n. But, these simply estimate the quantiles 
W~p(i/n) of the distribution whereof we want to estimate the tail index. Therefore, the 
accuracy of the tail index estimators is closely connected to the ability of estimating these 
quantiles W~^.(i/n). To illustrate the superiority of log-concave quantile estimation over 
simply taking order statistics, we calculated the relative efficiency of these two estimators. 
Since results were similar over an extended range of it's, we concentrate on the case a = 1. 

To fix notation, define the log-concave estimate Xtj\ = F~ (i/n) of an order statistic 
-X'(i), for i = 1, ... ,n. Let denote either X^ or X^, then its estimated variance and 
bias with respect to the i/n-quantile of a GPD W 7i i for a fixed 7 E [—1, 0] given simulated 
qn)j that are based on M generated samples X^ j, . . . , Xr n \ j, j = 1, . . . , M of size n drawn 
from W 7) i is defined as follows: 

M M 2 

Var(? (i) ,M) := (M - l)" 1 £ (q {i)d - (1/M) 

3=1 j=i 

M 

Bias(g (i) , 7 ,M) := M" 1 ( £ - W^{i/n). 

3=1 

The relative efficiency p-y t n,M(k) of log-concave quantile estimation to quantile estimation 
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based on order statistics is then 

[BiTs(X (fe) , 7 ,M)] 2 +Var(X (fc) ,M) 

P~f,n,M\ k ) = — — =— . 

[Bias(X (jt) , 7 ,M)]2 + Var(X (jt) ,M) 

Figure [2] details /? 7 ,32,iooo(&) f° r 7 £ {—1,-0.75,-0.5,-0.25,0} as a function of k. 
Relative efficiencies smaller than 1 are in favor of the log-concave quantile estimation and 
indicate its superiority. The use of the log-concave density estimator for the estimation 
of quantiles substantially reduces the variance of the estimation, due to its smoothing 
property detailed in Theorem 13. 1[ This transfers to a reduced MSE, uniformly in 7 and k, 
as is detailed in Figure [2j 




"i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r 

2 4 6 8 10 12 14 16 18 20 22 24 26 28 30 32 
number of order statistics 

Figure 2: Relative MSE in quantile estimation for n = 32. 



4.2 Smoothed versus unsmoothed tail index estimators. 

To assess the effect of smoothing the tail index estimators, we perform a simulation study 
for two settings. In Setting 1, we draw M = 1000 samples of size n\ = 64 from a GPD with 
(7 = 1 and the extreme value tail index 7 6 {—1, —0.75, —0.5, —1/3, —0.25, —0.1}. For every 
k, the log-concave density is estimated based on the full sample -^(1), ■ ■ • ,X( n y Setting 2 
consists of M = 1000 samples from a (3{9\, ^-distribution having density 
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The upper tail of the /3-distribution is dominated by # 2 , since for x f 1 we have that 
fe u e 2 (x) = c(0i,0 2 ) • w_ e -i e -i(x)[l + o(l)}. Thus, /0(0i,0 2 ) G T>(G_ 1/(h ). We fix ^ = 1/2 
and thus only the upper tail of fo.5,e 2 is log-concave. For the simulations, we choose 
62 = — G {1,4/3,2,3,4,10} and n 2 = 128 which equals deliberately 2ni in order to 
underline the difference between the two settings. Here the log-concave density estimator 
is based on the largest n 2 /2 = 64 order statistics. In both settings we present results for 
a single sample size only because the results for ni,n 2 /2 G {32,128,256,512} are very 
similar. 

Setting 1 represents the "ideal" of observing pure "peak over threshold" data 

X u ...,X n M£(X) = GPD G ?n(S). 

Alternatively, Setting 2 stands for the more general situation 

X 1 ,...,X n M£(X) GP(G 7 ; 7 G [-1,0]). 

In the latter case, the well known problem of the tradeoff between bias and variance dom- 
inates the optimal choice of k whereas in Setting 1, k can be chosen as large as possible 
since we are considering the "perfect" model. 

4.3 Simulation results. 

We compute relative efficiencies for the estimation of the tail index as for quantile estima- 
tion. For Setting 1, results are displayed in Figure O If one knows that 7 G [—1,0], then 
clearly using the smoothed estimator is most worthwhile for Pickands' estimator. However, 
also the other two estimators are substantially improved, with highest gain in terms of 
MSE for small fc's. 

Figure d] sheds light on the bias- variance trade-off in Setting 2 for 62 = 3; results for 
other choices of 82 were absolutely similar and therefore omitted. The variance in estimation 
of the tail index is dramatically reduced for the smoothed Pickands' estimate. The plot 
of the bias against k confirms that all estimators are biased as expected, since the data is 
generated by ([3|) and not by the GPD with 7 G [—1, 0]. Especially for Pickands' estimator, 
the bias is a much smoother function of k for the smoothed estimators than this is the 
case for the original estimator. Figure [5] shows the computed relative efficiencies for the 
estimation of the tail index for Setting 2. The results for 7pj c k and 7Faik are similar to those 
in Setting 1, yielding the most substantial improvement for Pickands' estimator. On the 
other hand the efficiency of the smoothed and original 7mvue is almost 1, independent of 
k and the choice of 82 ■ 
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Figure 3: Relative MSE for tail index estimation for peaks over threshold data, n = 64. 

5 Conclusions 

In this article we showed that for the class of distributions F € T>{G~ f ) having log-concave 
densities or log-concave conditional densities given that the observations exceed some 
threshold u, respectively, the smoothing of the empirical distribution function by the cor- 
responding log-concave density estimator leads to improved quantile estimation uniformly 
in 7 and the quantile to be estimated and to more efficient tail index estimators if it is 
known that 7 £ [—1,0]. The reduction of the mean squared error of the smoothed tail in- 
dex estimator is not surprisingly most substantial for Pickands' estimator, given the known 
poor efficiency of the latter. The estimator F n smoothes the empirical distribution function 
based on the global property of the log-concavity of the density of the underlying distri- 
bution function F. We showed that if such a global property is present then it is of great 
value for the estimation of a tail property such as the extreme value index 7. Of course the 
price to be paid is assuming that 7 is restricted to the narrow interval [—1,0] and it makes 
only sense to use the presented smoothed estimators if there is sufficient reason for that 
assumption. However, the novel bump hunting method in [30] is a possible tool for testing 
if a log-concavity assumption holds which automatically causes the desired restriction on 7. 
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Variance Bias WISE 




1 16 32 48 64 1 16 32 48 64 1 16 32 48 64 



number of upper order statistics k 

Figure 4: Domain of attraction scenario: n = 128, 9\ = 1/2, 62 = 3, smoothed (-) and original (- -) 
versions. 



We want to stress the fact that the Hill plots in Section and the results in our simulation 
study are not based on an optimal choice of the setting parameters and are reproducible 
for a wide range of parameter values for the 0(9±, (^-distribution unless the upper tail of 
the density is log-concave. Our simulation study can be reproduced using the R-package 
smoothtail (see |33j). 



5.1 Further research. 

The results in this article raise many new open questions and problems. We only mention 
two. At first it is challenging to prove asymptotic normality for the smoothed estima- 
tors. Then, by fitting the generalized Pareto density to f n or logu> 7>cr to <p n , respectively, 
opens the door to construct novel estimators for the tail index that are worthwhile to be 
investigated in the future. 
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1 16 32 48 64 1 16 32 48 64 1 16 32 48 64 

number of upper order statistics k 

Figure 5: Relative MSE in domain of attraction scenario, n = 128. O2 equals 10, 4 and 3 in the top 
row and 2, 4/3 and 1 in the bottom row. 
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